Speaker Diarization Features: The UPM Contribution to the RT09 Evaluation

نویسندگان
چکیده

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

EURECOM submission to the Albayzin 2016 Speaker Diarization Evaluation

This paper describes the speaker diarization system submitted by EURECOM for the Albayzin 2016 speaker diarization evaluation. This evaluation consists of segmenting broadcast audio documents according to different speakers and attributing those segments to the speaker who uttered them, without any prior information about the speaker identities nor their number. EURECOM system is based on the b...

متن کامل

Improved location features for meeting speaker diarization

This paper proposes several improvements to the correlationbased location features recently used in meeting speaker diarization. A speech-specific alternative to the generalized cross correlation phase transform (GCC-PHAT) algorithm is tested and shown to provide equal or better results without noise reduction or continuity-enforcing smoothing. The limitations of a single correlation reference ...

متن کامل

Modulation spectrogram features for improved speaker diarization

We propose the use of modulation spectrogram features in speaker diarization. These features carry longer term characteristics of the acoustic signals than the widely used MFCCs, thus providing potential improvement by using both features in combination. Using the state-of-the-art ICSI speaker diarization system, an improvement of 20.77% relative DER is obtained on the NIST Rich Transcription 2...

متن کامل

Speaker Diarization for Conference Room: The UPC RT07s Evaluation System

In this paper the authors present the UPC speaker diarization system for the NIST Rich Transcription Evaluation (RT07s) [1] conducted on the conference environment. The presented system is based on the ICSI RT06s system, which employs agglomerative clustering with a modified Bayesian Criterion (BIC) measure to decide which pairs of clusters to merge and to determine when to stop merging cluster...

متن کامل

native speaker norms and teaching english to non_native speakers : the case of iranian efl learners

امروزه، این که زبان انگلیسی سریع ترین و گسترده ترین زبان مورد استفاده در سراسر جهان است به عنوان یک واقعیت پذیرفته شده است. استفاده مشترک از زبان انگلیسی به عنوان یک زبان بین المللی مستلزم هنجارها و مدل های یادگیری و تدریس زبان است. زبان شناسان توجه ویژه ای به مفهوم "زبان مادری" به عنوان تنها منبع درست و قابل اعتماد از داده های زبان می داده اند.با این حال، این اصطلاح به اندازه کافی روشن به نظر ...

15 صفحه اول

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Transactions on Audio, Speech, and Language Processing

سال: 2012

ISSN: 1558-7916,1558-7924

DOI: 10.1109/tasl.2011.2159971